Towards a Better Exploitation of the Brown 'Family' Corpora in Diachronic Studies of British and American English Language Varieties

نویسنده

  • Sanja Stajner
چکیده

Since the 1990s, the Brown ‘family’ corpora have been widely used for various diachronic studies of 20th century English language. However, the existing methodologies failed to exploit its full potential as they only used the four main text categories. In this paper, we present the results of two experiments on diachronic changes of the Coleman-Liau readability Index (CLI) in British and American English in the period 1961–1991/2. The first experiment used all fifteen fine-grained text genres, while the second only used the four main text categories. The comparison of the results of these two experiments demonstrated the importance of using all fifteen finegrained text genres for obtaining a better understanding of how language changes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diachronic Stylistic Changes in British and American Varieties of 20th Century Written English Language

In this paper we present the results of a study investigating the diachronic changes of four stylistic features: average sentence length, Automated Readability Index, lexical density and lexical richness in 20th century written English language. All experiments were conducted on the largest existing diachronic corpora of British and American English – the Brown ‘family’ corpora, employing NLP t...

متن کامل

Using Comparable Corpora to Track Diachronic and Synchronic Changes in Lexical Density and Lexical Richness

This study from the area of language variation and change is based on exploitation of the comparable diachronic and synchronic corpora of 20th century British and American English language (the ‘Brown family’ of corpora). We investigate recent changes of lexical density and lexical richness in two consecutive thirty-year time gaps in British English (1931–1961 and 1961–1991) and in 1961–1992 in...

متن کامل

Exploring Male and Female Iranian EFL Learners’ Attitude towards Native and Non-native Varieties of English

This study investigated whether Iranian EFL learners are aware of different varieties of English spoken throughout the world and whether they have tendency towards a particular variety of English. Likewise, it explored the attitudes of Iranian EFL learners towards the native and non-native varieties of English. Moreover, it made an attempt to investigate whether such attitudes are gender-orient...

متن کامل

Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach

A syntactically complex text may represent a problem for both comprehension by humans and various NLP tasks. A large number of studies in text simplification are concerned with this problem and their aim is to transform the given text into a simplified form in order to make it accessible to the wider audience. In this study, we were investigating what the natural tendency of texts is in 20th ce...

متن کامل

Quantitative approaches to diachronic corpus linguistics

English Historical Linguistics has a rich and long-standing tradition of corpus-based work (cf. the surveys in Rissanen 2008, Kytö 2012). Resources such as the HELSINKI corpus, the BROWN family of corpora, and ARCHER have spawned active research programs for the study of lexical and grammatical change, both long-term (Curzan 2008) and short-term (Mair 2008). In addition, corpus resources inform...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011